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It is often reasonable to assume that the dependence structure of a bivariate continuous dis- 
tribution belongs to the class of extreme-value copulas. The latter are characterized by their 
Pickands dependence function. In this paper, a procedure is proposed for testing whether this 
function belongs to a given parametric family. The test is based on a Cramer-von Mises statistic 
measuring the distance between an estimate of the parametric Pickands dependence function 
and either one of two nonparametric estimators thereof studied by Genest and Segers [Ann. 
Statist. 37 (2009) 2990-3022]. As the limiting distribution of the test statistic depends on un- 
known parameters, it must be estimated via a parametric bootstrap procedure, the validity of 
which is established. Monte Carlo simulations are used to assess the power of the test and an 
extension to dependence structures that are left-tail decreasing in both variables is considered. 

Keywords: extreme- value copula; goodness of fit; parametric bootstrap; Pickands dependence 
function; rank-based inference 

1. Introduction 

Let X and Y be continuous random variables with cumulative distribution functions F 
and G, respectively. Following Sklar [36], the joint behavior of the pair (X,Y) can be 
characterized at every (x, y) £ R 2 by the relation 

H(x, y) = Pr(A < x, Y < y) = C{F(x),G(y)} (1) 

through a unique copula C that captures the dependence between X and Y. 
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When H is known, its marginal distributions can easily be retrieved from it. The copula 
can also be readily identified as it is simply the joint distribution of the pair (U,V) = 
(F(X),G(Y)). In practice, however, H is often unknown, and the relation between X 
and Y must be modeled from data. 

A copula model for H assumes that equation (1) holds for some F, G and C from 
specific parametric classes. This approach was used, for example, by Frees and Valdez 
[11] and Klugman and Parsa [25] to analyze data from the Insurance Services Office, 
Inc. on the indemnity payment (X) and allocated loss adjustment expense (Y) for 1500 
general liability claims randomly chosen from late settlement lags. Based on their work 
and subsequent analysis by other authors, it is reasonable to assume that for these data, 
F is inverse paralogistic, G is Pareto and C is a Gumbel-Hougaard extreme- value copula. 

Extreme- value copulas characterize the limiting dependence structure of suitably nor- 
malized componentwise maxima. They are of special interest in insurance [7], finance 
[6, 29] and hydrology [34], where the occurrence of joint extremes is a risk management 
concern. 

Pickands [31] showed that if C is a bivariate extreme-value copula, then 



C(u, v) = exp 



\og(uv)A 



logQ) 
log(w) 



(2) 



for all u, v G (0, 1) and a mapping A : [0, 1] — s- [1/2, 1], referred to as the Pickands depen- 
dence function, which is convex and such that max(i, 1 — t) < A(t) < 1 for all t G [0, 1]. 
For instance, an extreme-value copula is said to belong to the Gumbel-Hougaard family 
if there exists 9 G [l,oo) such that for all t G [0, 1], we have 

A{t) = {t e + {l-t) e } 1 ' 9 . (3) 

A test that a copula C is of the form (2) was developed by Ghoudi et al. [20]; it was 
recently refined by Ben Ghorbal et al. [1]. Under the assumption that C is an extreme- 
value copula, it may be of interest to check whether its Pickands dependence function A 
belongs to a specific parametric class, say A = {Ag : 9 G O}, where O is an open subset 
of MP for some integer p. 

The purpose of this paper is to examine how the hypothesis Hq : Ag A can be tested 
with a random sample {X\,Y\), . . . , (X n ,Y n ) from H. As for all goodness-of-fit tests 
reviewed by Berg [2] and Genest et al. [18], the proposed procedure is based on pseudo- 
observations (Ui, Vi), . . . , (U n , V n ) from copula C, defined, for i G {1, . . . ,n}, by 

Ui = F n (Xi), V i =G n (Y i ), (4) 

where F n and G n are rescaled empirical counterparts of F and G, respectively, given by 

Fn(x) = —— < x), G n (y) = —— < V) 

71+1 f—' 71+1 t— ' 
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for all i,j £ I. This approach is justified because, as copulas themselves, the pairs 
(Ui, Vi), . . . , (U n , V n ) of normalized ranks are invariant under strictly increasing transfor- 
mations of X and Y. As shown by Kim et al. [24], it also leads to efficient and robust 
estimators. 

The proposed test is described in Section 2 and its asymptotic null distribution is given 
in Section 3, where a parametric bootstrap is proposed for the calculation of P-values. In 
Section 4, the distributional result is extended to alternatives that are left-tail decreasing 
in both variables. This is instrumental in studying the consistency and power of the test, 
which are considered in Sections 5 and 6, respectively. The paper concludes with an 
illustrative example. Technical proofs are grouped in a series of appendices. 

All procedures discussed herein are implemented in the R package copula [38] available 
via the Comprehensive R Archive Network at http://cran.r-project.org. 

2. Proposed goodness-of-fit test 

Let (Xi, Yi), . . . , {X ni Y n ) be a random sample from some unknown continuous bivariate 
distribution H whose underlying copula is of the form (2) with Pickands dependence 
function A. In order to test the hypothesis 



a natural way to proceed is to compare a nonparametric estimator A n of A to a parametric 
estimator Ag n . Several measures of distance can be used for this purpose, but the Cramer- 
von Mises statistic 



Jo 

generally leads to more powerful tests than, say, the Kolmogorov-Smirnov statistic [18]. 
The choices of Ag n and A n are discussed next. 

2.1. Parametric estimation of A 

Under Hq, Ag may be estimated by Ag n using a consistent estimate 6 n of 9. Such an 
estimate can be derived from the pairs (U\, V\), . . . , (U n , V n ) via the maximum pseudo- 
likelihood method considered by Genest et al. [14] and Shih and Louis [35]. 

To illustrate this approach in a concrete case, let Ag be the generator of the Gumbel- 
Hougaard copula defined in (3). For all u,v £ (0, 1), write 



H :AeA = {A e :6eO}, 




(5) 




= exp[-{|log(u)| e + |log( U )|V A >]. 
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As Ag is twice differentiable on (0,1), the copula Cg has a density given by cg(u,v) = 
d 2 Cg(u,v)/dudv everywhere on (0, l) 2 . The maximum pseudo-likelihood estimator n is 
then the value 9 £ O = (l,oo) at which the function 

n 

e(6) = Y,log{c e (U i ,V i )} 

i=l 

reaches its global maximum. An advantage of this method is that it can be used even 
when the parameter space O is multidimensional. 

When 9 is real- valued, a simpler technique which also yields a consistent estimator is 
based on the inversion of Kendall's tau. As shown by Ghoudi et al. [20], the relation 

r(C) = -1 + A J] C(u, v) dC(u, v) = I dA'(t) 

is valid for any extreme- value copula C. When A £ A, r is a function of 9 and a rank- 
based moment estimate of the latter is obtained by solving the equation r n = t(9) for 
9, where r„ is the sample value of Kendall's tau. In the Gumbcl-Hougaard model, for 
instance, we find t{9) = 1 — 1/9 and hence 9 n = max{l, 1/(1 — r„)}. 

When O C R, we can also obtain consistent, rank-based estimates of 9 by exploiting 
its one-to-one relationship with other nonparametric measures of dependence such as 
Spearman's rho, that is, 

p{C) = -'i + 12 II uvdC(u,v)=-l + 1 J^pdi- 



2.2. Nonparametric estimation of A 



Nonparametric estimators of A are proposed by Genest and Segers [19]. For i £ {1, . . . , n}, 
set &(0) = - log(Ui), = - log(Fi) and 

-log(Pi) -log(Vi) l 
1-t ' t 



£i(t) = minj 

for all t £ (0, 1), where Ui and Vi are as in equation (4). Also, let 



{i " 1 1 " 

- E ^) . A « FG (*) = ex p -7 - - E i°gtfi(*)} 
i=l J L i=l 

where 7 = — J °° log(a;)e _:!: da: w 0.577 is Euler's constant. 

The functions A^ and ^4„ are rank-based versions of the estimators of A introduced 
by Pickands [31] and Caperaa et al. [4], respectively. As noted by Genest and Segers [19], 
these estimators can be altered to meet the end-point conditions -A^(O) = j4^ fg (0) = 1 
and = A^ FG (1) = 1. However, this makes no difference asymptotically. 
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Both A F and ^4„ can be expressed as functionals of the empirical copula, which may 
be defined for all u,v G [0, 1] by 



1 n 

C n {u,v) = -Y\{U l <u,V l <v). 



n . 
i=l 



To be specific, the following relations hold for all t G [0,1]: 



A F (t) = l/ j C^s 1 -',*') 
A GFG (t) = exp|- 7 + J {Cnix 1 -*,!*) - l(x > e" 1 



dx 

x 

dx 



)} 



xlog(x) 



It was shown by Riischendorf [33] that under weak regularity conditions, the process 
y/n(C n — C) converges in law to a Gaussian limit C, that is, y/n(C n -C)^»Casr»->oo. 
We may thus expect A F and A% FG to be consistent and asymptotically Gaussian. This 
is shown by Genest and Segers [19], provided that A is twice continuously differentiablc. 
Their Theorem 3.2 states that 

A F = ^{Al —A)~* A p , A CFG = ^{A GFG —A)~* A CFG 

as n —> oo in C[0, 1], where, for all t G [0, 1], 

dx 



A^(t) = -A z (t) I Cix'-^x*) 

dx 



A GFG (t) = A(t) / Ctf-^x*) 



x 

i 



xlog(a;) 

Remark. Observe that, in principle, the statistics S F and S GFG could be extended 
to arbitrary dimension d > 3 because d-variate extreme-value copulas are characterized 
by (d — l)-place Pickands dependence functions [10]. At present, however, multivariate 
analogs of the rank-based estimators A F and ^4„ FG are unavailable. To see how the 
estimation can proceed in the d-variate case when the marginal distributions are known, 
refer to [39] or [21]. 



3. Asymptotic null distribution of the test statistic 

The asymptotic distribution of the goodness-of-fit statistic S n depends on the joint be- 
havior of 9„ = \/n(9 n — 9) and either A F or A GFG under Hq. Suppose that the class 
A = {Ag : 9 G O} satisfies the following conditions: 

(A) the parameter space O is an open subset of M p ; 

(B) for every 9 GO, Ag is twice continuously differentiable on (0, 1); 
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(C) the gradient Ag(t) of Ag(t) with respect to 8 satisfies 

lim sup sup \\A e -(t)-A e (t)\\-+0, (6) 

e4-0 ||0*-0||<ete[O,l] 

where || • || denotes the ^2-norm. 

As is proved in Appendix A, the process ^ n ,e„ = y / n(A„ — Ag n ) is then asymptotically 
Gaussian, both when A n = A p and A n = A CFG ■ 

Proposition 1. Assume Hq holds, that is, C is an extreme-value copula with Pickands 
dependence function A = Ag for some 9q G O . Further, assume that A = {Ag :9 G O} 
meets conditions (A) -(C). 

(a) If (A F ,0„) converges to a Gaussian limit (A p ,0), then A n ,6 n A p — AJ as 
ti — ^ oo in C[0, 1]. 

(b) If (A CFG ,0„) converges to a Gaussian limit (A CFG ,0), then A n< g n A CFG - 
A J o as n oo in C[0, 1] . 

The weak convergence of the statistic S n defined in (5) follows immediately from 
Proposition 1 and the continuous mapping theorem (see, e.g., [37], Theorem 1.3.6). As 
the limit depends on the unknown parameter value 9q, we must resort to resampling 
techniques to carry out the test. The following parametric bootstrap procedure can be 
used to this end. Its validity depends on regularity conditions adapted from [17]. These 
conditions, listed in Appendix B, can be verified for many families of extreme-value 
copulas. 

Parametric bootstrap procedure 

(1) Compute An from the pairs (U\, V\), ■ ■ ■ , (U n , V n ) of normalized ranks and estimate 
9 using a rank-based estimator, as discussed in Section 2. 

(2) Compute the test statistic S n defined in (5). 

(3) For some large integer TV, repeat the following steps for every k € {1, ... , N}: 

(3.1) generate a random sample {X\k,Y\k), ■ ■ ■ , (X n k,Y n k) from copula Cg n and 
deduce the associated pairs (fife, Vifc), • ■ • , (U n k, V„k) of normalized ranks; 

(3.2) let A n k and 6 n k stand for the versions of A n and 9 n derived from the pairs 
(Ulk, Vik), . . . , (U n k, V n k); 

(3.3) compute 

Snk = [ n\A nk (t) - A 6nk (t)\ 2 dt. 
Jo 

(4) An approximate P-value for the test is given by A -1 J2k=i ^(^nk > S n ). 

4. Extension to left-tail decreasing copulas 

The statistic S n can be used to build goodness-of-fit tests for the more general hypothesis 



H*:CeC = {C g :9eO}, 
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where C is a parametric family of copulas that are left-tail decreasing (LTD) in both 
arguments. From [30], Exercise 5.35, a copula C is LTD in this sense if and only if, for 
all < u < u' < 1 and < v < v' < 1, 



C(u,v) C(u',v') 



(7) 



This condition is satisfied for extreme- value copulas, which Garralda-Guillcm [13] showed 
to be stochastically increasing in both variables. 

The following result, proved in Appendix C, implies that when C is an LTD copula, 
A F and A CFG are consistent, asymptotically Gaussian estimators of A c and A^ FG , 
respectively, where, for all t £ [0, 1], 



A c (t) = l/ [ C(x^\x^ 
1 Jo x 



and 



Ag FG (i)=exp 



{C{x 1 -\x t )-l{x>c- 1 )} 



dx 



x\og(x) 



Proposition 2. Suppose that the copula C has a continuous density and satisfies condi- 
tion (7). Then y/n~{A F - A c ) ~* and Vn{A GFG - Ag FG ) •w Ag FG as n-> oo in C[0, 1], 
where, for all t € [0, 1] , 

Ag(t) = ~{A P c (t)r f'c^.x')-, 
Jo x 

Ag FG W = Ag FG (i) fc^,,')-^. 

Jo xlog(a;) 

Incidentally, the mappings A^~, and ^4g FG are well defined for any copula C, whether 
or not it is LTD. They reduce to the Pickands dependence function A when C is of 
the form (2). Otherwise, they typically differ from one another, but retain some of the 
properties of Pickands dependence functions. These facts are summarized in the following 
proposition, the proof of which is left to the reader. 

Proposition 3. Let C be a copula and let Ac denote either A c or A C FG . Also, let W 
and M denote the lower and upper Frechet-Hoeffding bounds, respectively. The following 
statements then hold: 

(a) A w (t) > A c (t) > A M {t) = max(i, 1 - 1) for all t e [0, 1]; 

(b) ifC{u,v) > uv for all u,v G [0, 1]. then A c (t) < 1 for all t G [0, 1]; 

(c) if C{u,v) = C{v,u) for all u,v £ [0, 1], then A c {i) = A c (l - t) for all t E [0, 1]; 

(d) if C is an extreme-value copula with Pickands dependence function A, then Ac = 
A. 
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The bounds Ayy, A 1 ^ and = A c f G are depicted in the left panel of Figure 1. 
As a further example, consider the Farlie-Gumbel-Morgenstern copula with parameter 
9 e [-1, 1], defined for all u,v£ [0, 1] by C s (u,v) = uv + 9uv(l - u)(l - v). Condition (7) 
is met if 9 > and it is easy to check that for all t <G [0,1], 

a P / *\ " ^ /i CFG ( \ ( ^ *\ 

A> W=2i2-2i-4+(3i 2 -3i)0' e W=^2 + t^t2j ' (8) 

These functions are graphed in the right panel of Figure 1. 

Invoking Proposition 2, we can proceed as in Appendix A to show the convergence of 
the goodness-of-fit process in the case of LTD copulas, whence the following result. The 
parametric bootstrap algorithm described in Section 3 also applies mutatis mutandis and 
remains valid under such Hq. 

Proposition 4. Assume Hq holds, that is, C is an LTD copula such that C = Cg for 
some 6 eO. LetA p = {A ¥ c :CeC} and A C¥G = {A GFG : C e C}. 

(a) If AF meets conditions (A) -(C) and (A F ,0„) converges to a Gaussian limit 
(A F , 6), then A n , Sn ~* A£, — Aj g Q as n -> oo in C[0, 1]. 

(b) If A meets conditions (A)-(C) and (A c , 0„) converges to a Gaussian limit 
(Ag FG ,9), then A„ Ai ~* Ag FG - AJ Q Q as n -> oo mC[0,l]. 

5. Consistency of the test 

Suppose that C £ C is an LTD copula and that the hypothesis Hq :CgC is being tested 
with the Cramer-von Mises statistic S n . Let A n denote either A F or A GFG and let A 
stand for A^, or A^ FG , as the case may be. Further, assume that 8 n is a consistent, 



Bounds 




Figure 1. Left panel: graph of the bounds A^r (top curve), ^4^° (middle curve) and 
A V M = A^f G (bottom curve) . Right panel: graph of Aq (dashed) and A^F° (dotted) for the 
Farlie-Gumbel-Morgenstern copula with 6=1/2 (upper curves) and 1 (lower curves). 
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rank-based estimator of some 9* £ O. The test based on S n is then consistent, provided 
that A ^ A g , . 

To see this, decompose the process A ni g n as 

y/n{A n - A 9n ) = VTl(A n -A)- Vn~(A 6n -A e *) + Vn(A - A e , ). (9) 

Assume conditions (A)-(C) hold for A = A F or ,4 CFG and that as n — > oo, (y/n(A n — 
A),y/n(6 n — 0*)) ~** (A, 0*) to a Gaussian limit, where A stands for either A p or A CFG . 
We can then proceed exactly as in Appendix A to see that as n — > oo, y/n(A n — A) — 
y/n(A 9n - Ag,) A- Aj,Q*. If A ^ A e *, then sup tg[0 ^ y/n\A(t) - Ag,{t)\ oo and 
hence, for every e > 0, 

lim Pr(S' n > e) = 1. 

n— ^oo 

In particular, the test based on S n is consistent whenever C is an extreme- value copula 
and the hypothesized family C also consists of extreme-value copulas. However, consis- 
tency may fail otherwise, for it may happen that A = Ag* , even if Hq is false. 

To illustrate this point, consider the functions A F and A^ FG given in (8). As the latter 
are convex, they can be used to generate new families of extreme-value copulas, which 
may be called the FGM-P and FGM-CFG families. 

Now, suppose that C is the Farlie-Gumbel-Morgenstern copula with parameter 9 > 
and that the statistic S n is used to test Hq :Ag A when: 

(a) A is the Gumbel-Hougaard family of copulas; 

(b) A is the FGM-CFG family of extreme-value copulas. 

In case (a) , the tests based on A F and A GFG would be consistent because A^ and A CFG 
both differ from the Pickands dependence function of the Gumbel-Hougaard given in (3). 
In case (b), the test based on A F would also be consistent because A p ^ A CFG ■ The test 
based on A GFG may fail to be consistent, however, given that A CFG coincides with the 
Pickands dependence function of the FGM-CFG family. Consistency of the test would 
then depend on the behavior of 9 n . 

Suppose, for instance, that 9 is estimated by inversion of Kendall's tau. As n — > oo, 
9 n would approach 29/9, which is the population value of this dependence measure for 
the FGM copula. For the FGM-CFG family, however, Kendall's tau is 70/10 + 6> 2 /30, 
which coincides with 20/9 only when = 0, that is, at independence where the differ- 
ence between the two models is immaterial. Therefore, the test based on A GFG would 
be consistent in this case, provided that is estimated by inversion of Kendall's tau. 
A similar conclusion would be reached for inversion of Spearman's rho and maximum 
pseudo-likelihood estimation. 

6. Power study 

Equation (9) and the accompanying discussion suggest that just as for consistency, the 
power of the test based on S n depends on how different A = A^ or A GFG is from its 
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t = 0.25 t=0.5 x = 0.75 




0.0 0.2 0.4 0.6 0.8 1.0 0.0 0.2 0.4 0.6 0.8 1.0 0.0 0.2 0.4 0.6 0.8 1.0 

t t t 



Figure 2. Pickands dependence functions of the Gumbel-Hougaard, Galambos, Hiisler-Reiss 
and t-EV copulas when r = 0.25, r = 0.50 and r = 0.75. 

parametric estimate Ag* under Hq. This issue is investigated graphically in Section 6.1 
and via simulations in Sections 6.2 and 6.3. 

6.1. General considerations 

Consider the following three sets of LTD copula families. 

Group I: Symmetric extreme-value copulas: the Gumbel-Hougaard (GH), Galambos 
(GA), Hiisler— Reiss (HR) and Student extreme- value (t-EV) copula with four degrees 
of freedom. 

Group II: Symmetric non-extreme-value copulas: the Clayton (C), Frank (F), Normal 
(N) and Plackett (P). 

Group III: Asymmetric extreme-value copulas: asymmetric versions of the Gumbel- 
Hougaard (a-GH), Galambos (a-GA), Hiisler-Reiss (a-HR), and Student extreme- value 
(a- t-EV) copula with four degrees of freedom. 

Figure 2 shows the Pickands dependence functions of the copulas in Group I when 
r = 0.25, 0.50, 0.75. Although the curves are not identical, they are very similar. When 
the statistic S„ is used to distinguish between these models, therefore, the test will be 
consistent, but can be expected to have little power, even in moderate sample sizes. 

In Figure 3, the functions A^, and ^4§ FG are plotted for the copulas in Group II 
and the same values of tau. For comparison purposes, the curve corresponding to the 
Gumbel Hougaard copula is added. Here, the differences between the curves are much 
more pronounced. Thus, the power of the test based on S n may be expected to rise 
quickly (and be approximately the same) if the copula family under Ho is from Group I. 

Figure 4 shows the Pickands dependence functions of the copulas in Group III. These 
copulas were derived using Khoudraji's device [15, 23, 28], which transforms any sym- 
metric copula Ce into a non-exchangeable model via the formula 



C x ^o(u,v) = u 1 - x v 1 - K C e (u\v K ) 
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for all ii,w£ [0, 1] and arbitrary choices of A ^ k £ (0, 1). Furthermore, if Cg is an extreme- 
value copula with Pickands dependence function Ag, then C\. Ki g is also an extreme- value 
copula. Its Pickands dependence function is given, at all t G [0, 1], by 



Aa,m(*) = i 1 ~ K )t + (1 - A)(l - + {nt + A(l - t)}A t 



H't 



Kt + \(l-t) 



Note that the dependence in C\ >Kt g is limited since, by the Frechet-Hoeffding inequal- 
ity, 

Cx, K ,e(u,v) < u 1 ~ x v 1 ~ K min(V ,v K ) = mm(uv 1 ^ K , vu 1- *). 

As the right-hand term is the Marshall-Olkin copula MOa, k , Example 5.5 in [30], implies 
that 

T{C X , K .g) < t(MO a k) - ■ 1 X . ■ 
K + A — kA 



t = 0.25 t = 0.5 t = 0.75 




0.0 0.2 0.4 0.6 0.8 1.0 0.0 0.2 0.4 0.6 0.8 1.0 0.0 0.2 0.4 0.6 0.8 1.0 




Figure 3. Plots of Ac (top) and A§ FG (bottom) when C is the Gumbel-Hougaard (GH), 
Clayton (C), Frank (F), Normal (N) and Plackett (P) copula with r = 0.25 (left), r = 0.50 
(middle) and r = 0.75 (right). 
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t = 0.2 



q 



d 




< 



CO 

d 



d 



GH 

a-GH 

a-GA 

a-HR 

a-t-EV 



CD 

d 



0.0 



0.2 



0.4 



0.6 



0.8 



1.0 



Figure 4. Pickands dependence functions for the Gumbel-Hougaard copula and four asymmet- 
ric extreme- value copulas with r = 0.20: the asymmetric Gumbel-Hougaard (a-GH), Galambos 
(a-GA), Hiisler-Reiss (a-HR) and t-EV (a-t-EV) with four degrees of freedom. 

In the present study, the values A = 0.3, k = 0.8 were used and, hence, t(C\, k .b) could 
not exceed 0.279. For each choice of copula family Cg in Group III, the parameter 9 was 
set to make Kendall's tau equal to 0.20. 

Figure 4 shows that the Pickands dependence functions of the copulas in Group III 
are very similar, though distinct. They are, however, easily distinguished from their 
symmetric counterparts with the same value of tau. Thus, although these extreme-value 
copulas would be difficult to tell apart on the basis of S n in moderate samples, the test 
may still be reasonably powerful against copulas in Group I. 

6.2. Monte Carlo study 

The observations in Section 6.1 were confirmed through simulations. To this end, 1000 
random samples of size n = 300 were generated from 28 different copulas, C, correspond- 
ing to the following scenarios: 

(a) C belongs to Group I or II and r(C) G {0.25,0.50,0.75}; 

(b) C belongs to Group III and r(C) = 0.20. 

The statistics and S„ FG were computed for each data set. Four hypotheses of the form 
Hq : ^4 G A were then tested. The choices for A were the families of Pickands dependence 
functions for extreme-value copulas in Group I. 

All tests were carried out at the 5% level. Each P-value was computed on the basis of 
JV = 1000 parametric bootstrap samples. For comparison purposes, goodness of fit was 
also checked with the general purpose statistic 



T n = \C n (Ui,Vi) - CeSU>,V{)\ 2 . 



i=i 
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Table 1. Percentage of rejection of Ho for copulas in Group I when n = 300 



-on 


TV no 
11 IK. 




T — U.ZO 






T — u.ou 






t — (17r; 




T„ 


nP 


qCFG 


T„ 




oCFG 


T„ 


C-P 


oCFG 


V i 1 1 


VI I 1 


A 9 


O.O 




a n 


A 8 


o . u 


A 9 


p; i 


p; ^ 




GA 


4.8 


4.3 


4.2 


4.4 


3.8 


3.8 


4.8 


4.3 


4.3 




HR 


4.8 


4.2 


4.0 


5.4 


3.4 


3.9 


3.7 


3.1 


1.7 




t-EV 


4.2 


3.8 


4.5 


5.1 


5.5 


6.4 


4.8 


7.5 


8.9 


GA 


GH 


4.5 


4.7 


3.9 


4.0 


5.8 


4.7 


4.4 


5.6 


6.8 




GA 


4.3 


4.6 


4.0 


5.5 


3.9 


4.8 


4.3 


4.7 


4.6 




HR 


4.6 


4.8 


4.2 


5.0 


3.4 


3.4 


3.7 


3.7 


1.8 




t-EV 


4.6 


4.7 


4.4 


5.3 


8.0 


7.1 


5.7 


8.2 


10.9 


HR 


GH 


4.6 


6.4 


4.4 


4.3 


9.6 


7.5 


4.5 


9.6 


15.7 




GA 


4.3 


5.4 


4.5 


5.1 


6.6 


7.2 


5.1 


8.4 


11.7 




HR 


4.9 


5.2 


4.2 


5.3 


4.3 


3.9 


4.0 


4.3 


3.3 




t-EV 


4.6 


5.9 


4.8 


5.8 


13.7 


11.5 


6.6 


14.9 


29.3 


t-EV 


GH 


4.2 


3.4 


4.0 


4.1 


3.9 


2.9 


4.0 


3.3 


2.4 




GA 


4.1 


4.3 


4.4 


4.8 


3.4 


3.9 


4.6 


3.0 


1.7 




HR 


4.7 


4.1 


4.4 


5.4 


3.2 


3.4 


3.8 


2.2 


1.3 




t-EV 


4.6 


3.7 


4.2 


4.7 


4.8 


5.2 


4.1 


4.3 


4.7 



This particular test statistic was chosen because of its good overall performance in the 
large scale simulation studies of Berg [2] and Genest et al. [18]. 

Tables 1-4 report the percentages of rejection of the four null hypotheses under each 
scenario. Although this made little difference, these results are for the end-point-corrected 
versions of and ^ FG , defined for all t G [0, f] by 

1/AlM = l/Al{t) (1 - t){l/A p n (Q) 1} - t{l/Al{\) 1} 

and 

\ogAZ G {t) = \og{A^ G (t)} - (1 - 1) log{^ F G (0)} _ tlog{A^ FG (l)}. 

Before commenting on the results, note that for copulas in Groups I and II, the real- 
valued dependence parameter of each data set was estimated by inversion of Kendall's 
tau; its implementation relied on the numerical approximation technique of Kojadinovic 
and Yan [26]. For copulas in Group III, which involve several parameters, maximum 
pscudo- likelihood estimation was used [14, 35]. 

6.3. Results 

It is clear from Table 1 that when n = 300, the tests based on T n , S p and S CFG cannot 
distinguish between copulas in Group I. When r = 0.25, all rejection rates are within 
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Table 2. Percentage of rejection of Ho for copulas in Group I when n = 1000 





TV-no 
_LI Lie 
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T — U.OU 
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7 — U. lO 




Tn 


rfP 


oCFG 


Tn 


o n 


oCFG 


T n 


qP 


oCFG 


V i 1 1 


pTT 


A Q 


A 7 


o . o 


a n 






1 8 




• t . i 




GA 


5.1 


5.8 


4.0 


5.9 


4.4 


5.1 


4.8 


3.8 


4.2 




HR 


5.1 


6.3 


6.3 


5.1 


6.3 


0.0 


3.4 


3.5 


9.2 




t-EV 


5.4 


4.4 


5.4 


6.1 


6.2 


6.9 


5.4 


9.8 


15.9 


GA 


GH 


5.2 


7.4 


6.1 


4.4 


8.1 


8.4 


4.3 


5.6 


7.3 




GA 


5.0 


5.6 


4.0 


5.4 


5.1 


5.4 


4.8 


4.4 


5.2 




HR 


4.4 


5.0 


5.2 


4.5 


4.5 


6.2 


3.5 


3.1 


6.2 




t-EV 


6.1 


6.9 


6.6 


6.7 


9.4 


12.7 


5.5 


12.8 


23.1 


HR 


GH 


6.2 


10.6 


8.6 


5.1 


17.6 


17.8 


5.5 


18.1 


40.2 




GA 


5.4 


6.6 


4.1 


5.6 


8.1 


8.8 


5.5 


12.7 


23.4 




HR 


4.6 


5.9 


5.5 


4.2 


4.9 


5.1 


3.4 


4.7 


5.6 




t-EV 


6.6 


10.1 


8.2 


8.2 


27.0 


34.4 


0.5 


45.2 


81.7 


t-EV 


GH 


4.7 


4.7 


5.3 


4.4 


4.4 


5.7 


4.0 


3.4 


3.0 




GA 


4.8 


5.6 


4.2 


5.6 


4.8 


6.0 


4.8 


3.0 


3.7 




HR 


5.3 


6.4 


6.1 


5.5 


8.5 


12.3 


4.3 


3.9 


28.7 




t-EV 


5.1 


4.5 


5.5 


5.6 


4.8 


5.3 


5.2 


4.5 


4.7 



sampling error from the nominal level. There are only small signs of improvement as r 
rises to 0.50 and 0.75. The best scores are obtained when testing for the Hiisler-Reiss 
model with S^ FG when r = 0.75. Globally, there is little to choose between the tests. 

Table 2 shows what happens when n = 1000. Power is on the rise, especially when 
r = 0.75. In the latter case, it seems preferable to base the test on 5„ rather than on 
S^" - both do better than the test based on T n . Overall, the results remain disappointingly 
low, except when testing for the Hiislcr-Rciss model with r > 0.50. 

These observations are in line with Figure 2, which shows striking similarities between 
the Gumbel-Hougaard, Galambos, Hiislcr-Reiss and t-EV copula with four degrees of 
freedom. While S F and S G still have difficulty telling them apart when the sample 
size is 1000, their power eventually rises when n — > oo, as explained in Section 5. To 
illustrate this point, samples of various sizes were generated from the Gumbel-Hougaard 
copula with r = 0.50 and the statistic S GFG was used to test for the Galambos family. 
The following results, based on 1000 repetitions and = 1000 bootstrap samples, give 
an idea of the sample sizes needed to differentiate models in Group I: 



Sample size n 


5 000 


10 000 


20000 


40 000 


Percentage of rejection of Hq 


10.8 


22.6 


60.2 


97.3 



Returning to the case n = 300, we can see from Table 3 that the test based on S GFG 
is quite good at detecting non-extreme- value LTD alternatives from Group II. Its power 
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Table 3. Percentage of rejection of Ho for copulas in Group II when n = 300 
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N 


26.9 


21.9 


21.8 


43.5 


44.9 
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18.7 


82.3 




P 


34.3 


17.3 


43.4 


68.0 


44.6 


98.6 


65.0 


71.6 


100.0 


GA 


C 


98.9 


99.7 


84.0 


100.0 


100.0 


100.0 


100.0 


100.0 


100.0 




F 


39.8 


15.1 


50.1 


83.4 


10.2 


100.0 


92.1 


29.9 


100.0 




N 


28.1 


25.7 


21.9 


44.0 


49.0 


69.5 


37.4 


21.5 


83.2 




P 


37.7 


23.4 


45.0 


70.8 


57.1 


99.0 


65.7 


76.7 


100.0 


HR 


C 


99.1 


99.9 


84.5 


100.0 


100.0 


100.0 


100.0 


100.0 


100.0 




F 


42.3 


18.8 


52.5 


85.2 


18.9 


100.0 


93.7 


42.1 


100.0 




N 


28.3 


29.0 


22.5 


46.0 


55.7 


73.3 


38.9 


34.8 


89.8 




P 


41.1 


28.8 


48.3 


75.1 


74.5 


99.5 


73.1 


92.1 


100.0 


t-EV 
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98.6 


99.5 


82.6 


100.0 


100.0 


100.0 


100.0 


100.0 


100.0 




F 


36.7 


11.1 


48.3 


81.3 


5.0 


100.0 


90.9 


18.9 


100.0 




N 


26.5 


21.8 


21.7 


43.5 


42.7 


66.2 


36.9 


10.6 


74.2 




P 


34.8 


17.2 


43.7 


67.7 


35.7 


98.1 


62.1 


53.2 


99.8 



Table 4. Percentage of rejection of Ho for copulas in Group III when n = 300 



Ho 


True 


T n 


oP 
'~ n 


c-CFG 


GH 


a-GH 


32.7 


40.9 


86.5 




a-GA 


33.5 


42.8 


86.7 




a-HR 


28.4 


37.5 


83.5 




a-t-EV 


33.1 


41.4 


88.6 


GA 


a-GH 


33.4 


40.8 


89.2 




a-GA 


34.0 


42.3 


89.3 




a-HR 


28.4 


38.1 


86.5 




a-t-EV 


32.7 


40.6 


90.5 


HR 


a-GH 


36.2 


37.5 


93.3 




a-GA 


31.7 


39.1 


89.7 




a-HR 


32.6 


40.9 


90.3 




a-t-EV 


40.5 


42.8 


92.3 


t-EV 


a-GH 


32.0 


41.2 


87.1 




a-GA 


33.2 


43.3 


87.8 




a-HR 


27.3 


38.4 


83.6 




a-t-EV 


31.3 


40.7 


88.7 
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is higher than those of S% and T n , except when the data are generated from the Clayton 
or the Normal copula with r = 0.25. Interestingly, the general purpose test based on T n 
is often second best. The statistic S£ has the edge only for the Clayton when r = 0.25; 
it does very poorly against the Frank, and against the Normal when r = 0.75. 

These results are in close agreement with the plots displayed in Figure 3. Consider, for 
instance, the case where S„ is used to test for the Gumbcl- Hougaard copula from weakly 
dependent data (r = 0.25). From Table 3, the alternatives can be ranked as follows in 
decreasing order of power: 

Clayton >- Normal y Plackett y Frank. 

Looking at Figure 3, we find that this ordering is concordant with the overall degree of 
dissimilarity between A^, and A. In this case, as in others, it is found that at fixed sample 
size, curves that look alike are harder to distinguish than others. 

Finally, Table 4 shows that the statistic S„ is much better than the other two at 
detecting asymmetric extreme-value alternatives. The overall good performance of this 
test is consistent with evidence from [19] that A^ FG is generally a better nonparametric 
estimator of the Pickands dependence function than A^ . When the margins are known, 
this phenomenon is well documented; see, for example, [4, 22] or [32]. 

7. Conclusion 

Copula models are now common. As illustrated, for instance, by Ben Ghorbal et al. 
[1], so are situations in which the dependence structure of a random pair (X,Y) is 
well represented by an extreme-value copula, even though X and Y themselves do not 
necessarily exhibit extreme-value behavior. In such cases, the statistics considered here 
can be used to test the goodness of fit of specific parametric copula families of the form 
(2) such as the Gumbcl-Hougaard, Galambos, Hiisler-Reiss or Student extreme-value 
copula. 

Theoretical and empirical evidence presented here shows that the nonparametric tests 
based on the Cramer-von Mises statistic S n are generally consistent and that they are 
an effective tool for distinguishing between symmetric and asymmetric extreme-value 
copulas, as well as for detecting other left-tail decreasing (LTD) dependence structures. 

Except in the presence of massive data, however, it seems very difficult to discriminate 
between extreme- value copulas whose Pickands dependence functions arc close. This may 
come as something of a disappointment, but, on reflection, we may wonder whether, in 
the light of Figure 2, there is any practical difference between, say, the Gumbel-Hougaard 
and the Galambos copula when they have the same value of Kendall's tau. 

For example, many studies have concluded that a Gumbel-Hougaard copula structure 
is adequate for the insurance data mentioned in the Introduction; see, for instance, [5, 

8, 9, 11, 15, 16] or [27]. In these papers, comparisons were made between the Gumbel- 
Hougaard model and non-extreme- value copulas that were either Archimedean or meta- 
elliptical. 
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As Ben Ghorbal et al. [1] conclude that the data exhibit extreme-value dependence, 
it may be worth comparing the Gumbcl Hougaard structure with other extreme- value 
copulas from Groups I and III. This is done in Table 5 using the statistics and £ CFG 
and the inversion of Kendall's tau to estimate 9. Because the test is yet to be adapted to 
the case of censoring, the analysis ignored the 34 claims for which the policy limit was 
reached. Each P-value in the table is based on N = 2500 bootstrap samples. Given the 
comparatively small sample size, n = 1466, it is little wonder that no model is rejected 
at the 5% level. 

Figure 5 displays the end-point-corrccted estimates A„ c and A GF C G for the data at 
hand. For comparison, the best-fitting symmetric and asymmetric Galambos extreme- 
value copulas are superimposed. Although these two models yield the highest P- values, 
they are not significantly better than the alternatives listed in Table 5. Given the estima- 
tors' sampling variability, the data set is simply too small to distinguish between them. 
This is not a major concern, however, as predictions derived from these various models 
would be roughly the same. To paraphrase Box and Draper ( [3] , page 424) , it may be that 
all these models are false, but they are nearly equivalent and probably equally useful. 

Appendix A: Proof of Proposition 1 

Let A„ denote either A F or A CFG and write A n ^ n = A„ — M n> g , where M rit g a = y r n(Ag n — 
Ag ). As the sequence 0„ is assumed to converge weakly, it is tight. Thus, for given <5 > 0, 
there exists L = L(8) such that Pr(||0„|| > L) < S holds for every integer n. Therefore, 
for given ( > 0, 

Pr{ sup \M n< g {t)-AJ o (t)Q n \>t} 




~i 1 1 1 1 r ° n 1 1 1 1 r 

0.0 0.2 0.4 0.6 0.8 1.0 0.0 0.2 0.4 0.6 0.8 1.0 



Figure 5. Nonparametric estimates A^ c and A^j^F , and fitted Pickands dependence function, 
for the Galambos copula (left) and asymmetric Galambos copula (right). 
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Table 5. Values of the statistics S„, S„ FG and approximate P- values computed using N = 2500 
parametric bootstrap samples for the insurance data 



Model 




P-value 


qCFG 


P-value 


GH 


0.087 


0.073 


0.048 


0.171 


GA 


0.084 


0.074 


0.045 


0.184 


HR 


0.088 


0.067 


0.049 


0.157 


t-EV 


0.088 


0.069 


0.048 


0.166 


a-GH 


0.052 


0.274 


0.012 


0.152 


a-GA 


0.046 


0.325 


0.009 


0.244 


a-HR 


0.051 


0.272 


0.011 


0.174 


a-t-EV 


0.062 


0.204 


0.015 


0.122 



<Pr{ sup \M n ,e (t) - AJ (t)Q n \>(,\\Q n \\< l\ +Pr{\\e n \\> L) 
L te[o,i] J 

<Pr{ sup \m n ,e (t) - A] o (t)e n \> c\\e n \\ < l\ + s. 

L te[o,i] J 

An application of the mean value theorem then implies that for every realization to 
of the process and every t G [0, 1], M n! g (t,ui) = Aq, , t Jt)Q n {ui), where 0*(i,w) = 9 + 

e(t,(jj)n~ 1 / 2 <d n (u}) for some e(t,ui) € [0, 1]. It then follows from condition (6) that 

lim p r { sup \m n f (t) - AJ o {t)e n \ >c,l|e„|| <l\ 

n^oc L te [ ,l] > 

< lim Pr{||e„|| sup |i e , (t) (t)-i eo (t)|>C,||e„||<L) 
n ^°° L te[o,i] J 

< lim Pr{ sup sup \Ag(t) - A 6o (t)\ >(/l}=0. 
n^oo i-||e_e ||< n -i/3 L t e [o,i] > 

This completes the argument. 

Appendix B: Validity of the parametric bootstrap 

To avoid repetitions, let A„ denote either or A„ and let A stand for either A p or 
A CFG . The following conditions, adapted from [17], ensure the validity of the parametric 
bootstrap for computing P-values for the proposed tests. 

(a) The family {Cq : 9 € O} of extreme-value copulas must be such that: 

(i) the parameter space O is an open subset of W; 

(ii) members of the family are identifiable, that is, for every e > 0, 

inf( sup \\Ag(t)-Ag (t)\\:eeO and \\6 - 6» j| > e\ > 0; 
L te[oa] J 
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(iii) the mapping 8 i-> Ag is Frechet diffcrcntiable with derivative 8 H> Ag, that 
is, for all 8 £ O, 

\\Ag a + h {t) - Ag (t) - Al(t)h\\ 

hm sup — = 0; 

l|fc||^»te[o,i] ll'HI 

(iv) Cg has a Lebesgue density eg for all 8 £ O; 

(v) the density eg admits first- and second-order derivatives with respect to all 
components of 8 £ O; the gradient (row) vector with respect to 8 is denoted 
eg and the Hessian matrix is denoted eg; 

(vi) for arbitrary (u, v) £ (0, l) 2 and every 8 £ O, 9t-> cg(u,v)/cg(u, v) and 8 M> 
cg(u,v)/cg(u,v) are continuous at 8q, Cg almost surely; 

(vii) for every 6$ £ O, there exist a neighborhood AT of do and a Lebesgue inte- 
grable function h : (0, l) 2 — > R such that sup egA ^ ||cg(u, v)\\ < h(u, v) holds for 
all (u,v) £ (0,1) 2 ; 

(viii) for every 8q £ O, there exist a neighborhood N of 8q and Ce -intcgrablc 
functions h 1 ,h 2 : (0, l) 2 -> R such that for all (u, v) £ (0, l) 2 , 



sup 



Cg(u,v) 



Cg(u,v) 



< /ii(u, u) and 



sup 



cg{u,v) 



Cg{u,v) 



< h 2 (u,v). 



(b) In addition, the estimators A n and 8 n satisfy the following: 

(i) (A„,6 n ,W„) w (A,6,W) in X>([0,1],R) x R?® 2 as n-^oo, where the limit is 
a centered Gaussian process. Here, 



for a random sample (J7* ,V*), . . . , (£7*, V^) from Cg a and W is 7V(0, Ip), where 
Jp is the Fisher information matrix; see [17], page 1101. 
(ii) E eo (6W T ) = J, where J is the p x p identity matrix. Further, E eo {A(i)W} = 
Ag (t) for every t £ (0, 1). 

Condition (b) can be checked as follows, under the assumption that (A„, 0„) ~+ (A, 0) 
as n — > oo. First, results from Chapter 5 of [12] can be combined with the functional 
delta method (see, e.g., [37], Section 3.9) to see that as n — > oo, (A ra ,0 n ,C„, W„) ~+ 

(A,e,c,w). 

Next, observe that E eo {C(u,u)W} = Cg (u,v) for all u,v £ [0, 1]; see [17], page 1108. 
Given that, for all t £ [0, 1], 

dx 



A t '(t) = -Ai o (t) / Ce^y) 



and 



k^(t) = A eo (t) / C^Gt 1 -*,^) 



d.c 



x\og(x) ' 
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we can see that 

f 1 dr 
E So {A p (i)W} = -<(i) / C^s 1 -*,*') — 
Jo x 

and 

Ee {A CFG m}=Ae (t) / C^s 1 "*,**) — — . 

JO XLOg[X) 

Interchanging the order of differentiation and integration, we get Ee {A p (t)W} = 
E 9o {A CFG (t)W} = A 9o (t) for all t e (0, 1). 

As for the condition Ee o (0W) = J, it can be verified using [17], Proposition 4, for 
the estimators based on maximum pseudo-likelihood and on the inversion of Spearman's 
rho. To handle the estimator based on Kendall's tau, Proposition 5 in [17] must be used 
instead. 



Appendix C: Proof of Proposition 2 

The proof closely mimics the argument presented in [19], Appendix B. To avoid dupli- 
cation, the same notation is used and only the critical differences are highlighted. This 
also offers an opportunity to correct minor typographical errors in the original source. 

First, consider the process given by B p (i) = n l / 2 {l/A^(t) - l/A%(t)} for all t £ [0, 1] 
and show that B p ~> B = -A P 7 /(A P 7 ) 2 as n -> oo. Then 



,P aP\_ -( A c) 2 ^n 

l + n-Vap p " p ^ c ' 



as a consequence of the functional version of Slutsky's lemma. 
Put k n = 2 log(n + 1) and write 

B p (t) = [ C n (x 1 - t ,x t )— = / 00 C„(e-^ 1 ^,e- st )d s = / 1 , n +/ 2 ,„, 
Jo x Jo 

where, for each t £ [0, 1], 

h,n(t)= Cnie-^^e-^ds, I 2 ,n(t)= C^e-* 1 -*), e~ st ) ds. 



Jk n JO 

The contribution of I\, n {t) is asymptotically negligible because the fact that s > k n 
implies that min(e~ s ^ 1_ *\ e~ st ) < l/(n + 1) and hence that 

\C n (e-« l - t \e-«)\=n l / 2 C(e-< l - t \e-*)<n 1 '*mm^ 

Thus, for all t € [0,1], 

p OO pOQ cy 

\h, n (t)\<n 1/2 J k Cie-'W^dsKn 1 ' 2 ^ c- s ' 2 & S <-^. (A.l) 
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Consequently, the asymptotic behavior of is determined entirely by 7 2 ,n- Invoking 
the Stute representation given by Genest and Segers [19], we may write l2, n = Ji,n + 
Ji,n + Jz.n +o(l), where, for each t £ [0, 1], 

Ji,n(t)= r"a«(c- s(1 -* ) ,e- s *)d S , 



Ja,n(*) = - / n an(e- s ( 1 - t ),l)Ci(e- s ( 1 - t ) ) e- st )d Sj 



Ja,n(t) = - / a„(l,e- s *)C 2 (e- s ( 1 - t ),e- st )d S . 



Here, Ci(u,i>) = dC(u,v)/du, C2(u,v) = dC(u,v)/dv and a„ is the empirical process 
associated with the pairs (F(Xi), G{Y X )), (F(X n ),G(Y n )). 

Fix u £ (0, 1/2) and write = i"(l - t) w for all t £ [0, 1]. Also, let 

K 1 (s,t) = q u {mm(e- s( - l - t \e- st )}, 
K 2 (s,t) = ^(e-< 1 -*))C'i(e- s ( 1 -*),e- si ), 
^ 3 (s,t) = ^(e- st )C 2 (e- s ( 1 - t \e- st ) 

for all s £ (0,oo) and £ € [0, 1]. The proof that J\. n + J 2 , rl + J^ n has the stated limit 
then proceeds exactly as in Appendix B of [19], provided that for i = 1,2,3, there exists 
an integrable function K* : (0,oo) — > K such that Ki(s,t) < K*(s) for all s £ (0,oo) and 

te[o,i]. 

For Ki, this is immediate because Ki(s, t) < e~" s / 2 for all s £ (0, 00) and t £ [0, 1]. For 
K2, the facts that C is LTD and smaller than the Frechet-Hocffding upper bound imply 
that 

C 1 (e- 8 ( 1 -^e- si )<e s ( 1 - t 'C(e- s ( 1 - t ),e^)<e s ( 1 -''miii(e- s ( 1 - t ',e- rf ), 

Now, set m(t) = max(t, 1 - t) and note that g w (c~ s(1_t) ) < e - "^ 1 -') for all s £ (0,oo) 
and t £ [0, 1]. Therefore, 

K 2 (s,t) < c s ( 1 ~")( 1-t 'e -sm( ' t ' < c^i-^Wtlg-™!') _ e -awm(t) < e -us/2 

because m(t) > 1/2 for all i £ [0, 1]. The argument for K3 is similar. 
Turning to the estimator, observe that 



e G (i) = n^{\o g A^ G (t) - log Ag FG (t)} 

| dx 



1 H<r- Z" 00 Ha 

C n ( x i-\ x t)-^— = - / Cn(e -.(i-*) )e -«rt ) £f 
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for all t £ [0, 1]. This process can be written as —(Ii n + h,n + l3,n), where 

r°° H« 
h. n (t)= CnCe-C 1 -*)^-'*)-, 

I 2> „(t)= I"" Cnie-^-Q^-**)-, 

Jin s 

h,n(t)= I"" C^e-^Ke-^)- 
Jo s 

with fc„ = 21og(n + 1) as above and t n = l/(n + 1). 

Arguing as in (A.l), we see that |ii, n | < nT 1 ! 2 . Similarly, I 3 ^ n is negligible asymptoti- 
cally, for if s £ (0,£ n ) and t £ [0, 1], then we have 

s (l_ t ) a u ^ -l/( n +l) 



min(e- 8 ^,e- st )>e-^ n+i J > 



n+1 



and hence C„(c s ^ *\e st ) = 1. Furthermore, the fact that C is LTD implies that 
C(e- s(1 - 4) ,e- st ) > e" s for all s £ (0,oo) and t £ [0, 1]. Therefore, 

IC^e-^-^e" 8 ')! < n l / 2 {\ - e~ s ) < n^s. 

Consequently, |/3, n | < n x l 2 l n < n~ x l 2 . As a result, the asymptotic behavior of B^ FG 
is determined entirely by l2, n - Following Genest and Segers [19], we can further write 
h,n = Ji.n + h, n + Jz,n + o(l), where, for all t £ [0, 1], 

Ji,n(*)= a n (e-'^-*\e-' t )-, 

Jin S 

J 2 ,n(t) = r a n {p-* 1 - t \l)di(e-« 1 - t \e-' t ) — , 
Jin s 

J 8 ,n(*) = - / a„(l,e- s *)C 2 (e- s ( 1 - t ),c- st )-. 

Jin S 

The joint asymptotic behavior of these terms can be determined in the same way as 
before. The only difference is that the integration measure is now ds/s. For s £ [l,oo), 
the same upper bounds K*, K2, K£ apply and they have already been shown to be 
integrable on this domain. To obtain an integrablc bound for K\ on (0, 1), it suffices to 
use the fact that K x {s,t) < (1 - e - sm W) w < {sm(t)}" < s u . The same bound works for 
both K2 and K3 because C% £ [0, 1] for i = 1,2. This completes the argument. 
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